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^ -7 ^' BACKGROUKD OF THE INVENTION 



1 . Field of the Invenlion 

The present invention relates to the discovery "of the human survival 
motor-neuron gene or SMN gene vs^ich is a chromosome 6-SMA (Spinal 
Muscular Atrophy) determining gene. The present invention further relates to the 
nucleotide sequence encoding the SMN gene and corresponding amino acid 
sequence, a vector containing the gene encoding the SMN protein or a DNA 
sequence corresponding to the gene and transformant strains containing the 
SMN gene or a DNA sequence corresponding to the gene. 

More particularly, the present invention relates to means and methods for 
detecting motor neuron diseases having symptoms of muscular weakness with or 
without sensory changes such as amytrophic lateral sclerosis (ALS), spinal 
muscular atrophy (SMA), primary lateral sclerosis (PLS), arthrogryposis multiplex 
congenita (AMC), and the like. The methods for detecting such motor neuron 
diseases include, but are not limited to, the use of specific DNA primers in the 
PGR technique, the use of hybridization probes and the use of polyclonal and 
monoclonal antibodies. 

Even more particularly, the present invention relates to the use of the 
human SMN gene or part of the gene, cDNA. oligonucleotide or the encoded 
protein or part thereof in therapy by insertion of the human SMN gene or part of 
the gene. cDNA, oligonucleotide or the encoded protein or part thereof, if 
required, into engineered viruses or vectors that serve as harmless carriers to 
transport the gene or part of the gene, cDNA. oligonucleotide or the encoded 
protein or part thereof to the body's cells including bone marrow cells. 
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The invention 

gene. 



further relates to antigen sequences directed to the SMN 



In order to provide means for the therapy of motor neuron diseases, the 
invention also relates to the protein encoded by the SMN gene. 

The present invention also relates to the isolation of the mouse SMN 
gene the nucleotide sequence encoding the mouse SMN gene and 
corresponding amino acid sequence. A transgenic mouse model that 
hyperexpresses all or part of the SMN gene and a transgenic mouse model 
produced by homologous recombination with a mutated SMN gene .s also 
described. 



State of the Art 



Degenerative motor neuror. diseases can be placed into three major 
categories Amyotrophic lateral sclerosis or ALS. motor neuron diseases such as 
spinal muscular atrophy (SMA) and motor neuron diseases associated with other 
degenerative disorders such as primary lateral sclerosis (PLS). 

Amyotrophic lateral sclerosis (ALS) is the most frequently encountered 
form of progressive neuron disease and is characteristically a disorder of middle 
age The disease is characterized by progressive loss of motor neurons, both in 
the cerebral cortex and in the anterior horns of the spinal cord, together with their 
homologues In some motor nuclei of the brainstem. It typically affects both upper 
and lower motor neurons, although variants may predominantly involve only 
particularly subsets of motor neurons, particularly eariy in the course of illness. 

ALS is evidenced by the development of asymmetric weakness, with 
fatigue and cramping of affected muscles. The weakness is accompanied by 
visible wasting and atrophy of the muscles evolves and over time, more and 
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„or. nu,s;es becon,o lavCved un.i, me disorder «Kes on 3 symmetric 

dl...bu.-,on io a., reslons. <,«.ud>n« musCes of chewing. 

movement of the face and tongue. Fifty percent o. patients having ALS can t« 

expected to die *i«>ln three to Ave years from the onset of the d.sease^ 

Presently, there is no treatment that has Influence on the pathologic process of 

ALS. 

Spinal muscular atrophies (SMA) are characlerized by degeneration of 
anterior horn cells of the spinal cord leading to progressive symmetrical limb and 
trunk paralysis associated with muscular atrophy. SMA represents the second 
most common fatal, autosomal recessive disorder after cystic fibrosis (1 out 6000 
O newborns) Childhood SMA is classically subdivided into three clinical groups on 

:» the basis of age of onset and clinical course. The acute form of Werdn.g- 

S Hoffmann disease (Type 1) is characterized by severe generalized muscle 

^ ■ weakness and hypotonia at birth or in the 3 months following birth. Death, from 

S respiratory failure, usually occurs within the first two years. This disease may be 

distinguished from the intermediate (Type II) and juvenile (Type 111. Kugelberg- 
Welander disease) forms. Type II children were able to sit but unable to stand or 
5 walk unaided, and they live beyond 4 years. Type III patients had proximal 

a muscle weakness, starting after Ih. age of two. The underlying biochemical 

% defect remains unknown. In addition there is known to exist a slowly evolving 

03 adult form of SMA. sometimes referred to as SMA IV. 

Primary lateral sicerosis (PLS) is a variant of ALS and occurs as a 
sporadic disease of late life. Neuropathologically in PLS there is a degenerat.on 
of the corticospinal (pyramidal) tracts, which appear almost normal at brainstem 
levels but become increasingly atrophic as they descend through the spinal 
column. The lower limbs are affected earliest and most severely. 

Arthrogryposis Multiplex Congenita (AMC) is a frequent syndrome 
characterized by congenital joint fixation (incidence of 1 out of 3000 live births) 
resulting from decreased fetal movements in utero (Stern. W.G.. JAMA. 81:1507- 



tiU I nHNN-KLHbiitKHUU I" HA : iO- 1 -^l^bOUOyU ^p.ULl.yD 11 NO.UU;^ K . Uta 



1510 (1923) ; Hall. J.G.. Clirv Orthop.. 194:44-53 (1985)). AMC has been 
ascribed to either oligo-hydramnios or a variety of diseases involving the central 
nervous system, skeletal muscle, or spinal cord. Since neuronal degeneration 
and neuronophagia occur in the anterior horns, it has been hypothesized that the 
AMC of neurogenic origin could be related to acute spinal muscular atrophy; 
SMA Type I Werdnig-Hoffman disease (Banker. B.Q.. bum. Pathol., (1986); 
117:656-672). 

The detection and clinical diagnosis for ALS. AMC. SMA and PLS is quite 
limited to muscle biopsies, the clinical diagnosis by a physician and 
electromyography (EMG). For example, the clinical criteria for diagnosing SMA is 
set forth in the Clinical Criteria International SMA Consortium (Munsat T.L.. 
Neuromuscular Disorders. Vol. 1. p. (1991)). But due to the complications of 
the various tests to detect motor neuron disorders, tHe clinician usually attempts 
to eliminate various categories of other disease states such as structural lesions, 
infections, intoxications, metabolic disorders and heriditary biochemical 
disorders prior to utilizing the above-described test methods. 

Presently there is no treatment for any of the above-mentioned motor 
neuron disorders. Basic rehabilitative measures, including mechanical aids of 
various kinds, may help patients that have these diseases overcome the effects 
of their disabilities, but often confining respiratory support systems are necessary 
to have the patient survive longer. 

Accordingly, it is an object of the present invention to characterize the 
SMA gene which is responsible for SMA disorders and to clone the SMA gene 
into a vector, for example a plasmid. a cosmid. a phage, a YAC vector, that can 
be used in the transforniation process to produce large quantities of the SMN 
gene and SMN protein. 



In yet another aspect of the invention is the use of primers and 
hybridization probes to detect «nd diagnose patients having motor neuron 
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disorders such as AMC. ALS. SMA and PLS. Yet another aspect of the present 
invention is the use of the SMN gene or part thereof or cONA. oligonucleotides, 
protein or part thereof in therapy to correct disorders present in. for example 
AMC. SMA. ALS and PLS patients, especially gene disorders. 

in yet another aspect, the present invention provides monoclonal and 
polyclonal antibodies for detection of SMN gene defects in SMA patients. 

Another object of the present invention provides the characterization of 
the SMA gene in the mouse. A transgenic mouse model is presented that 
hyperexpresses all or part of the SMN gene or a transgenic mouse that by 
homologous recombination with a mutated mouse SMN gene produces 
abnormalities in the SMN gene is also described. 

According to a furti.er aspect of the invention, the therapy of motor neuron 
diseases can involve the protein encoded by the SMN gene. 

These and other objects are achieved by the present invention as 
evidenced by the summary of the invention, the description of the preferred 
embodiments and the claims. 



OBJECTS AND S UMMARY OF THE INVENTION 

It is an object of the present invention to provide a novel human Survival 
Motor Neuron gene or SMN gene, its DNA sequence and amino acid sequence. 



Another aspect of the present invention provides a novel mouse Survival 
Motor Neuron gene or SMN gene, its DNA sequence and amino acid sequence. 
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Yet another aspect of the present invention is the provision of a vector 
which is capable of replicating in a host microorganism to provide large 
quantities of the human or mouse SMN protein. 

Yet another aspect of the present invention is the provision of specific 
DNA sequences that can be used to detect and diagnose spinal muscular 
atrophy and other motor neuron disorders. These DNA sequences can be used 
as primers in the polymerase chain reaction to amplify and detect the SMN gene 
sequence, a truncated or mutated version of the SMN gene sequence or lack of 
said sequence which leads to the diagnosis of SMA. AMC. and other motor 
neuron disorders. 

Yet another aspect of the present invention provides a transgenic mouse 
that hyperexpresses all or part of the SMN gene or a transgenic mouse that by 
homologous recombination with a mutated mouse SMN gene produces 
abnormalities in the SMN gene is also described. 



>^ The investors have identified two genes respectively designated T- 
BCD541 and C-BCt\541 . which are involved in motor neuron disorders. 



The T-BCDb41 gene Is responsible for the motor neuron diseases of the 
SMA type, since its\^ration either by partial or total deletion, by mutation or 
any other modification/^ufficient to lead to a pathological state at the clinical 
electromyographic or musci^fnorphological levels. 

Ol UV^The C-BCD54l\^e is different from the T-BCD541 gene, at the level of 
^^^(f^e cDNA, since two iWleotides are modified. This C-BCD541 gene is 
nevertheless not correctly ^K^ssed during the transcription in controls and 
patients suffering from motof\euron diseases. The genomic DNA of the 
C-BCD541 gene is not correctly s^H^ced during the transcription providing thus 
for an abnormal transcript. The differehce between the splicing of the T-BCD541 
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and the ^C0541 gene results from differences in the sequence of the introns 

of these gertes. 



^ ornar 



H> The preV characterizes the structure and 

o^^nization of thl\,man SMN gene which was found to be approximately 20 kb 
in length and cons\ of 9 exons interrupted by 8 introns. The nucleotide 
sequence, amino acidWence as well as the exon-intron boundaries of the 
human SMN gene is set in Figure 10. All exon-intron boundaries display the 
consensus sequence found iNsolher human genes. A polyadenylation consensus 
site is localized about 550 bp dWnstream from the stop codon (Figure 10). The 
entire intron/exon structure of the^MN gene permits the characterizations of the 
SMN gene mutations in SMA diseas^S^other motor neuron diseases. 

SJ<y The presen\.nvention also defines means for the detection of genomic 
^formalities relati\to motor neuron diseases at the level of the T.BCD541 
gene or at the level of <l^ C-BCD541 gene. 

The genes of the invention can be further defined in that each of them 
comprise intronic sequences corresponding to the following sequences : 

In the T-BC0541 gene 
- for intron n" 6 : 



5 « AATTTTTAAATTITTTGTAGAGACAGGGTCTCATTATGTTGCCCAGGGTG 
GTGTCAAGCTCCAGGTCTCAAGTGATCCCCCTACCTCCGCCTCCCAAAGTTGT 
GGGAITGTAGGCATGAGCCACTGCAAGAAAACCTTAACTGCAGCCTAATAATT 
GTTTTCTTTGGGATAACTTTTAAAGTACATTAAAAGACTATCAACTTAATTTC 
TGATCATATTTTGTTGAATAAAATAAGTAAAATGTCTTGTGAACAAAATGCTT 
TTTAACATCCATATAAAGCTATCTATATATAGCTATCTATGTCTATATAGCTA 

TTa-TTTTTAACTTCCTTTTATTTTCCTTACAG 3' f '^V I ) 



fox: 3js-i-a:i'bbu«yu ^re. 
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- for intron n* 7 : 

GTTGA«GGTTAATOTiAAACAATCAAT*^ ^^^^ 



TGGTTATGTGGT^GAAACAii^ ACTTTTTTTTATTGTGATATGG 
CATaT-rATATCACTAGTACGCAGACCA^^^^^ 

^O^X-rAACTCGTG.rCCACACAG<,ACATGGTT^^^^^^ 
TGGTTCTAATTTCTCATTTGCAG 3-^(^^6tji ^ 



In the C-BCD541 gene : 
- for inlron n" 6 : 

Kn.GACAGGOTCTCATTATGTTGCCCAOGGTGG'rGTCAAGCTCCA 
^a-TTTXAAATXTTTTOTAGAGACA^^^^ 

GGTCTCAAGTGATCCCCCTACCTCCa^TCCCAA^^^^^^^^^^^ 
CAAOAAAACCTTAACTGC^GCCT^^^^^ 

CAAAATGCTXTTTAACAXCCATAXAAAGC.^^^^^^ ^ - 

TTTTTTTTAACTXCCT-PXTATTTTCCTTACAG* ^ C> ertV > >/ 

- for intron n° 7 : 

»GTAAGTCTGCCAGCATXATGAAAGTGAATCTTACTTTT 

^^^^^.'i.iU.ACAAATGTXXXTGAACAGTTAAAAAGTTCAGATGTTAra. 
GXAAAACXXXAXGGXXXGXOGAAAACAAA^^^^^^ 

AAOXTGAAAQGXTAATOXAAAACAAXCAATAXX ^^^^^^^^^^^ 
„ACTIICACAAT^«^GAGCWXAGG ^^^^ ^^^^^ 



C.CACTTXX.T.TAI.OTGATXTGGGAX*ACCTAaG.^«TG^^ 
OAAGTGCXCT*GTCAXGTTT«0T0OTGT=C.CAGAG<^«TGOrTX^^ 

CCCTCTGOaXCTAXTTTCTCATITaCAOY f ^ fed r^T? M*- 
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In a preferred embodimer.1 of the invention, the gene of the '"^^.^Kg^is^^ 
. capable of hybridizing in stringent conditions with the sequence of Figure Soused 

^ as probe. 



As hereabove writlen, the invenUon further relates to a variant of the SMN 
gene. «h,ch variant is a C-BCDS^^ene having^ _a_^DNA sequence 
corresponding to the sequence of Figure 2^ 



The invention also relates to cDNA sequences such as obtained from one 
of the above genes. Such cDNA sequences are disclosed in Figures 2 and 3. 
□ Both of these CDNA sequence are capablp ^'^^^^ogTWr^'f f " ^"^^^^'^^ '"^^ 

S£7 amino acid sequence described on Figure 1^ 

ifi Despite this capacity to encode for such a protein, the inventors have 

i D noted that the C-BCD541 gene is^ble to produce inviyo this protein or is not 

S able to produce it in a sufficient quantity due to the abnormal splicing of the gene 

during the transcription. Thus, the presence of the C.BCD541 gene does not 
^ enable to correct in viv Q the deficiency (deletion, mutation,...) of the T.BCD541 

H gene responsible for the motor neuron diseases of the SMA type or other motor 

■Q neuron disorders. 



In a part\(tar embodimenl. the invention relates also to a nucleotide 
^quence comprisiiW«ucleotides 34 to 915 of the sequence of Figure 3. or to a 
sequence comprising nbdeotides 34 to 915 of the sequence of Figure 2. 

l^?^These nucleohde sequences correspond to the coding sequence of 
^^^^pectively the T-BCDb^l gene and C-BCD541 gene. 

The introns of the hereabove described genes are also included in the 
application. Especially introns 6 and 7 have respectively the following 
sequences : 



10 



ill 



For me T-8C0541 gene : 
- intron 6 : 

:GGIS^T"G^XGIGCCKeTGC^A=^AAACCTT«CTGC^CCCT«^^^^ 
:;™rGGGATAAeTa..AAAGTACATTAAAAGACTAT«ACT™ 

TTI-lWTAACTTCCTTTTATTTTCCTTACAG ^ 



!=j - Intron 7 : 



5 . GTAAGTCTGCCAGCAI-TATGMAGTGAATCTTKCTTa-rGTAAAACITTAT 
LT^^TCGAAAACAAATGTTmGAACAGTrAAAAAGTTCAGATGTTAAAAA 
!^III^G^^TGTAAAAC«.TCAATATTAAAOAA«TTGAT=CCAAAACT 
^rAlI^TTAATCTACATCCCTACTAGAATTCTCATACTXAACTGGT 
^^G^AAGAAACATACnTTCACAATAAAOAGCTTTAGGATATOATGC 
^^™CACrAGTAGGCAGACCAGCAGACTTT.TTTTATTGTCATATGG 
^t^AC^ALcATACTGCACTGTACACTCTGACATATGAAGTGCTCTAGTCA 

rrAr^CCACAGAGGACAXGG^A^^^^ 



For the C-8CD541 gene : 
- Intron 6 : 

«^^a.n«.TTATCTTGCCCAaaGTGG'rQTCAAGCTCCyV 
;,;,TTTT^AAATTXT'rTGTAGAGACAGGGTCT^^ 

aGTCTCAAGTGATCCCCCTACCTCCOCCTCCCAA^^^^^^^^ ^^^^^ 
CAAGAAAACXTTAACTGCAGCCTAATAATTGTTTX^^^^ 

CAAAATGCTTTTTAACATCCATATAAAGCTATCXA 
^ TTTTTTTTAACTTCCXTTTAT'XTTCCTTACAG* C>^« ^ 



n 



- Intron 7 : 



GTAAGTCTGCCAGCATTATGAAAGTGAATCTTACTT'rT 
GTAAAACTTTATGGTTTGTGGAAAACJUUVTGTTXTTGAACAOXTAAAAAGTTCAGATGTTA^ 
AAGTTGAAAGGTTAATOTAAAACAATCAATATTAAAGAATTTTOAXGCCAAAACTATXAGATA 
AAAGGTTAATCTACATCCCTACTAGAATTCTCATACXTAACVGGTTGGTa;GT 
ATACTTTCACAATAAAGAGClTTAGGATATGATGCCATTTTATArCACTAGTAGGCAGACCAG 

CAGACTa*TTTXTTAT'rGTGATATGGGATAACCTAGGC:ATACTGfl\CTGTACACT^ 
GAAGTGCTCXAGTriAAaTTTAACTGGTGTCCACAGAGGACATGGlXTAACTGGAATTC^ 

GCCTCTGGTTCTAAT'rXCTCATTXGCAG* ^ C ^^(2 ^ >^ - 

=13 The invention further encompasses a nucleotide sequence, characterized 

j!^ in that it comprises at least around 9 nucleotides and in that it is comprised 

ij§ within a sequence v^ich has been described above or in that it hybridizes with a 

jjf sequence as described above in hybridization conditions which are determined 

m after choosing the oligonucleotide. 

^1 For the determination of the hybridization conditions, reference is made to 

C3 the hybridization techniques for oligonucleotides probes such as disclosed in 

=1; Sambrook et al. Molecular Cloning, a Laboratory ManuaL 2nd edition. 1989, 

The sequences of the invention are either ONA (especially genomic DNA 
or cDNA or synthetic DNA) or RNA. They can be used as probes for the 
detection of the T-BCD541 or C-BCD541 genes or as primers for the 
amplification of genomic DNA present in a biological sample. 

Preferred primers are thov;e comprising or relating to the following 
sequences : 

\^ a) 5' AGACTATCAACTTAATTTCTGATCA 3'(R111)^ 

b) 5' TAAGGAATGTGAGCACCTTCCTTC 3' (541C770) 9 ^z^A^.^ 

6> ^ 



FRX: 33-l-42bbU«yU ^re. 
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The above primers are characteristic of exon 7 of the ^-BCOMI gene. ^ ^ ^ 
^ (c) GTAATAACCAAATGCAATGTGAA (541 C960) C 55^ ^ 

(d) CTACAACACCCTTCTCACAG (541 C1 1 20) ^ C ^ ^CP ^ V 

^ The above primers are characteristic of exon 8 of the T.BCD541 gene. 

The primers used by pairs can form sets for the amplification of genomic 
DNA in order to detect motor neuron diseases. 

Inverted complementary sequences with respect to the above primers can 

□ also be used. 

^ Preferred sets of primers are the fbllov/ing : 

-J J 

i - a pair of primers contained in the sequence comprising nucleotides 921 

II j to 1469 of the sequence of Figure 3 and/or 

:U - a pair of primers comprising the following sequences . ^.p /vO : ^ i 

^ 6' AGACTATCAACTTAATTTCTGATCA 3^^ 



5' 



TAAGGAATGTGAGCACCTTCCTTC 3" C'^^^^^'t^ 



m Another preferred set of prime rs comprises : 

^ - a pair of primers having ths following sequences : :pP = 

^ 5' AGACTATCAACTTAATTTCTGATCA y ^ Z^) 

V 5' TAAGGAATGTGAGCACCTTCCTTC ^ 

- a pair of primers having the following sequences : ^^-^ t^V ^) 

5' GTAATAACCAAATGCAATGTGAA 3' ^ and/or ^ . 

^ 5- CTACAACACCCTTCTCACAG 3' f 9^ 
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From a general point of view for the detection of divergence in exon 7, 
between the T'BCD641 and C-BCD541 genes oligonucleotide primers can be 
selected in the fragment 5' from the divergence and within exon 7 or intron 7. 

Other primers that can be used for SSCP analysis for diagnostic purposes 
are selected from amongst the following : 



3> 
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5 'EXON 1 

121MD 
121ME 

EXONl 

12 IMA 
12 IMF 

EXON2A 

EX2A5 

EX2A3 

EXON2B 

EX2B5 
EX23 

EXON3 

SM8C 
161CR2 



SM8D 
541C260 



12l2Cid/121xiie 



Size: 17 0 bp 



3' AGG GCG AGO CTC TOT CTC AV 
5' COG GAG GAC CGC TTG TAG T (C^^Sl ^9 



121ma/121mf Size: 180 t>p 

5 ' GCC GGA AGT CG*V CAC TCT T 
5 



' GGG TGC TGA GAG CGC TAA TA C^^trtS "^P 



ex2A5/Ex2A3 Size: 242 bp 

5' TGT GTG GAT TAA GAT GAC TC 



5' CAC TTT ATC GTA TGT TAT C ( g'trtP :J^P 



£x2n5/£:X23 



Size: 215 bp 



CTG TGC ACC ACC CTG TAA CAT G 



5' AAG GAC TAA TGA GAC ATC ^-^[^ 



6M8C/161CR2 Size: 238 bp 



^ fSStP 

5' AG CTA CTT CAC AGA TTG GGG AAA G( T ]? A^'^ 



S' CGA GAT GAT AGT TTG CCC TC 



SM8D/C260 



Size: 150 bp 



5' CTC ATC TAG TCT CTG CTT CC ^ v 
5 ' TGG ATA TGG AJVA TAG AGA GGG AGC f S^CP ^ 
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1^ 



EXON4 

SM3CA 
541C460 



SM3CB 
541C380 

EXON5 

EX55 
541C590 



EX53 
541C550 

EXON6 

164C25 
S41C849 



EX63 
541C618 

EX0N7 

Rill 

541C770 



Rill 
164C261 



SM3CA/C460 Sizo:lSObp /^<'gr(i ""^ ^ 

5 ' CAC CCT TAT AAC AAA AAC CTG C ^ j 

5' GAG AAA GGA GTT CCA TGG AGC AG ^ J 5^ y^p AA!>^'^i) 

eM3CB/C380 Size: 180 bp (-C^^^O^'^^ 

5' GAG AGG TTA AAT GTC CCG AC 



5' GTG AGA ACT CCA GGT CTC CTG G 



IT' 



EX55/C590 



Size: 254 bp 



TGA GTC TGT TTG ACT TCA GG 



5 ' GAA GGA AAT GGA GGC AGC CAcT C ( C,-^\S "^Q^' ^ 

r\ 

KX53/CS50 Size-. 168 bp r'^^'^^ ^'^V 

5 ' TTT CTA CCC ATT AGA ATC TGG ^ 

5' CCC CAC TTA CTA TCA TGC TGG CTG J^V ^ ' ^ 



A 



164C25/C849 



Size t 143 bp 



5' CCA GAC TTT ACT TTT TGT TTA CTG 



ATA GCC ACT CAT GTA CCA TGA 



'(^-^ '^l) 



EX63/C618 Si2e:248 bp 

5' AAG AGT AAT TTA AGC CTC AGA CAG ^ 
5' CTC CCA TAT GTC CAG ATT CTC TTG 3 

R111/C770 Size, 200 bp ^^-^V^^W- 

S' AGA CTA TCA ACT TAA TTT CTG ATC A 
5' TAA GGA ATG TGA GCA CCT TCC TTC ^^^^ ^p^O'-^ ) 



R111/C261 



Size:244 bp TC^^'^'^^ 
5 ' AGA CTA TCA ACT TAA Tl'T CTG ATC A 
5 ' GI^A AGA TTC ACT TTC ATA ATG CTG ^ 
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INTR0N7 

164C45 
164C265 

EXON8 

541C960 
541C1120 



164C140 
541C920 



164C45/164C265 Size: 220 bp ^ .^^j^ A/l) = ^3^. 

5 ' CTI' TAT GGT TTG TGG AAA ACA 3 |' 

5' GGC ATC ATA TCC TAA AGC 'TC^^g^jrp aa)- ^ ^ ? 

A 

C960/C1120 Sizet 186 bp rcj^rTQ XU M?'^^^ 

5 ' GTA ATA ACC AAA TGC AAT GTG AA ( >^ 



5 ' CTA CAA CAC CCT TCT CAC AG ^ c^^r^t 



164C140/C920 C 3^0 jn 

5 ' GGT GTC CAC AGA GGA CAT GG ^ ^ ^ ^ ^ >^ 



5' AAG AGT TAA CCC ATT CCA GCT 'YQQ. ^ ^ 



The invention also concerns antisense DNA or RNA. capable of 
hybridizing with the C-BCD541 gene and particularly to the intron sequences, 
especially with the fragment of the introns which differ from the correspond.ng 
part in the T-BC0541 gene. 

The invention also relates to a protein comprising the amino acid 
sequence of Figure 1. or to a protein having the amino acid sequence of 
Figure 8. 

The protein relating to the sequence of Figure 1 can be used in a 
composition for the treatment of motor neuron diseases, via oral, intra-muscular, 
intravenous administration, or via administration in the spinal cord fluid. 

The invention further provides a kit for the invitro diagnosis of motor 
neuron diseases, comprising : 

- a set of primers as described above ; 

- reagents for an amplification reaction ; and 

- a probe for the detection of the amplified product. 
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According to another embodiment of the Invention, a kit for the detection 
of the motor neuron diseases containing a hybridization probe as described 
above is provided. 

Oligonucleotide probes corresponding to the divergences between the 
genes can be used. 

The diagnosis can be especially directed to SMA motor neuron pathology. 

The invention also concerns cloning or expression vectors comprising a 
nucleotide sequence as defined above. Such vectors can be. for example, 
i plasmids, cosmids. phages. YAC. pYAC. and the like. Preferably, such a vector 

M has a motor neuron tropism. Especially for the purpose of defining means for 

^ gene therapy, it can be chosen among poliovirus vector, herpes virus. 

□ adenovirus, retrovirus vectors, synthetic vectors and the like. 

Within the scope of the Invention are contemplated further recombinant 

□ sequences. The invention also concerns recombinant host cells, i.e.. yeasts, 
n CHO cells, baculovirus. bone marrow cells. E. coU, fibroblasts-epithelial cells. 

transformed by the above recombinant sequences. 



The invention also relates to a method for detecting motor neuron 
disorders including spinal muscular atrophy, amyo trophoc lateral sclerosis and 
primary lateral sclerosis, said method comprising the steps of : 

(a) extracting DNA from a patient sample ; 

(b) amplifying said DNA with primers as described above ; 

(c) subjecting said amplified DNA to SCCP ; 

(d) autoradiographing the gels ; and 

(e) detecting the presence or absence of the motor neuron disorder. 



# 0 

FHX: AS-i-Q-^bbliU'^v Ty. 



GUTMftNN-PLflSSERflUD ' FHX: 55-l-^»2bbU»yu T^^^ul i . 11=30 no.uuz r.io 

Steps (c) and (d) can be replaced by a step of digestion with BsrI enzyme 
or with any other enzyme capable of recognizing specifically the divergence of 
the genes or mismatches in genes, or by sequencing. 

The invention also relates to a method for delecting spinal muscular 
atrophy, said method comprising the steps of : 

(a) extracting DNA from a patient sample ; 

(b) hybridizing said DNA with a DNA probe comprising all or part of the 
cDNA sequence of Figure 3 or of Fifiure 2 under stringent conditions; and 

(c) detecting the hybrids possibly formed. 

□ The invention also relates to a method for detecting arthrogryposis 

g multiplex congenita, said method comprising the steps of : 
b (a) extracting DNA from a patient sample ; 

:| (b) amplifying said DNA via PGR using unlabeled primers from exon 7 

m and exon 8 of the SMN gene ; 

liJ (c) subjecting said amplifier* DNA to SCCP ; 

r3 (d) autoradiographing the gels ; and 

^ . (e) detecting the presence or absence of arthrogryposis multiplex 

nj congenita. 

=13 

^ Yet another method to detect arthrogryposis multiplex congenita concerns 

dinucleotide Repeat Polymorphism Analysis using genotyping markers C272 and 
C212 after PGR amplification. 

The present invention further concerns P°'^|2"?r> f '""^ ° W :x^> 
monoclonal antibodies directed to the pr£.iein of Figure \ protein of Figure 8 
^ protein of Figure 12^ 4 ^ 

Yet another aspect of the present invention is directed to the use of the 
entire or partial nucleotide sequence of SMN as a probe to detect SMA as well 



1- 
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as to indetify and clone genes related to SMN gene motor neuron in animals or 

organisms. 

Yet another aspect of the present invention is the use of the SMA protein 
to produce polyclonal and monoclonal antibodies, which antit>odies may be used 
to detect and diagnose SMA, 

^ J<-Vfy in an^er aspect, polyclonal rabbit antiserum were generated against 
f^Ji^i^eilo pepti^corresponding to the amino acid sequence of Figures 1. 8 and 
^ 12. including the a^no acid terminus and the carboxy terminus. 

^ Accordingly, in one of its process aspects, the present invention relates to 

P the detection of SMA in patients having SMA or related motor neuron disorders 

;°i such as AMC. ALS and PLS. 

i Yet another aspect of the present invention is to administer the SMN gene 

part thereof. cDNA or oligonucleotides to patients who are either lacking the 
!ii gene or have a genetically defective gene as such or after incorporation into 

n engineered viruses or vectors. 

i These and other aspects of the present invenUon will be discussed in 

detail below in the preferred embodiments of the present invention. 



BRIEF DE SCRIPTiONOF THE DRAWINGS 

^ Fig. iSthJ^f^i^r^o acid sequence of the SMN coding region of the clone 

T-BCD541. . 

i). R^i^ nucleotide sequence of the SMN coding region as well as the 

5- end's- flanking regions of clone C.BCD541 ; the coding region is underlined. 
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Fig. 28 contains the sequence starting from intron 6 up to exon 8 of the C- 
BCD541 gene. The underlined sequences are those of exons7 and 8. 
Sequences of introns 6 and 7 can be chosen as oligonucleotides to amplify the 
cDNA region allowing the distinction, within exon 7. between the T-BCD541 gene 
and the C-BCD541 gene. The position of the divergent nucleotides between the 
T-BCD541 and C-BCD541 cDNA are in italics. 

Fig 3k is the nucleotide sequence of the SMN coding region as well as 
the 5' and 3' flanking regions of clone T.BCD541. The coding sequences are 
underlined. The numbers of the exons are indicated on the sequence. Asteriks 
indicate the beginning of each exon. The nucleotides which are indicated in 
italics are those which differ between the C-BCD541 and the T-BCD541 genes. 
3 ^ C^tr» -J-^ 

}J fX Fig. 3B represents the sequence from intron 6 up to the end of exon 8 of 

3 the T-BCD54rgene. The sequence of exons 7 and 8 is underlined. 

Fiq 4 is the nucleotide sequences of the markers •Ga4^r^272r-&^^ 
^ AFMl57xd10. andC161^^^ ^ 

Fig. 5 represents various probes utilized in the present invention revealing 
several loci that the probes hybridized to. 

Fig. 6 represents the telomeric element containing the survival SMN gene. 

Fig. 7 represents the marked decrease of gene dosage with probe 
132SEII. mapping close to this. 

(/" Fig. 8 re^eS^ts'fhe l^ino acid sequence of the truncated SMN protein. 

Fig. 9 is a schematic representation of the genomic structure of the human 
SMN gene. The designations and positions of genomic clones are shown above 
the figure. L-132. L-5. and L-13 depict the genomic clones spanning the entire 
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SMN gene, while L-51 spans part of exon 1. Micro satellites and DNA markers 
are indicated above the genomic map. B. H. and E mean ML HindlU and 
EcoRI. respectively. C212. p322. C272. 13286.1 and C171 represent various 
ni^-avi«;R7 and 8 reoresent exons of the SMN and C- 

from exon 1 to exon 2A. ^ 

Fig. 10^tpre?e^i^ t'he nucleotide sequence and amino acid sequence of 
the entire hCman SMN gene including the introns and exons. Tg^^<Jgg 3 
nucleotide sequences are in upper case, with the corresponding amino acids^ 
Shown below that. The polyadenylaiion signal Is in bold face. Arrowheads 
indicate the position of the single base differences between SMN and C-BCD541 
genes in Introns 6 and 7 and exons 7 and 8. Italic letters indicate the position of 
the oligonulcoeitdes chosen for the detection of divergences in .ntron7. C) 
indicates the position of th^op.codon. 




Fig 11 ?^sents theVucleotide sequence upstream of the coding region 
of the human SMN gene and illustrates the presence of putative binding sites for 
the transcription factors of AP-2. GH-CSE2. DTF-1. E4FI. HINF-A. H4TF-1. 
IFN and Spl. Bold letters indicate the dinucleotide repeat (CA) corresponding to 
the C272 markers. \ 

Fig. 12 represents the Nucleotide and amino acid sequences of Mouse 
SMN cDNA. n indicates the position of the stop codon. 



Fifl 13 reoresents a comparative analysis of the amino 
^ human SMN?(above) and' mouse SMN ibelow). 



acid sequence of 

AOS. cf-'^sej 



Fig 14 illustrates the genetic analysis of family 6. Lane A shows evidence 
of inherited maternal deletion seen with the microsatellite marker C272 as the 
proband inherited only allele from the father. Lanes B and C represent SSCP 
analysis of PCR-amplif.ed exons 7 (lane B) and 8 (lane C) of SMN (closed 



GUTMfiNN-PLPSSERflUD h HX : ^6- 1 --q-^bbUb^U "^^u i . t>D ix-do nu.ou^ r . 

21 

arrowheads) and its centromeric copy (open arrowheads). "F" represents the 
father. "M" the mother, and "A" the affected infant. 

Fig. 15 illustrates the band shifts on single strand confirmation 
polymorphism (SSCP) analysis of the PCR amplified intron 7 and permitted 
indetification of SMN (dosed arrowheads) and its centromeric counterpart C- 
BCD541 (open arrowheads). 



=13 



nFTAILED D ESCRIPTION OF T HE PRE FERRED EM BODIMENTS OF THE 
INVENTION 

As used herein, the term "contig" means overlapping nucleotide 
sequences. 

Previous studies by means of linkagec analysis have shown that all three 
forms of spinal muscular atrophy map to chromosome 5q11.2-q13.3. (L.M. 
Brzustowicz et al. Nature. 3^. 540 (1990) ; J. Melkl et al. Nature. 345. 823 
(1990); J. Meiki et al. Uncet 336. 271 (1990). A yeast artificial chromosome 
(YAC) «)ntig of the 5q1 3 region spanning the disease locus was constructed that 
showed the presence of low copy-repeats in this region. Allele segregation was 
analyzed at the closest genetic loci detected by markers derived from the YAC 
contig (C212. C272 and C161) in 201 SMA families. These markers revealed two 
loci (C212. C272) or three loci on the 5q13 region (C161). Inherited and de novo 
deletions were observed in 9 unrelated SMA patients. Moreover, deletions were 
strongly suggested in at least 18% of SMA type I patients by the observation of 
marked heterozygosity deficiency for the loci studied. These results indicated 
that deletion events are statistically associated with the severe form of SMA. 

By studying all polymorphic DNA markers derived from the YAC contig. it 
was observed that the smallest rearrangement occured within a region bordered 
by loci detected by C161 and C2^2-C272 and entirely contained in a 1.2-Mb 
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YAC Clone 903D1. See, for example, French Patent Application No. 9406856 
incorporated herein by reference. 

The present invention characterized the small nested critical SMA region 
of about 140 Kb by a combination of genetic and physical mapping in SMA 
patients. This region suggested a precise location for the SMA gene and 
therefore, a limited region within which to search for candidate genes. The 
present invention identified a duplicated gene from the 5q13 region. One of them 
(the telomeric gene) is localized v/ilhin the critical region. Moreover, this gene 
was lacking in 213 out of 230 (92-2%j or interrupted in 13 out of 230 (5.6%) SMA 
patients. In patients where the teiomeric gene is not lacking or interrupted, 
deleterious mutations Indicated that this telomeric gene, termed survival motor- 
neuron (SMN) gene, is the chromosome 5 SMA-determining gene. 

The SMN gene was discovered using a complex system of restriction 
mapping, distinguishing the E"^*** from the E^" by Southern blot, and the 
determination of the differences between the E"^*' in SMA patients by genetic and 
physical mapping. After confirming the location of the SMN gene, a phage contig 
spanning the critical region of the telomeric element was constructed to identify 
specific clones containing the SMK ^^ene. 

Analysis of the SMN gene in SMA patients compared with those of normal 
patients revealed either the SMN gene was either lacking or truncated in 98% of 
SMA patients or had combined mutations not present in normal control patients. 

To identify a large inverted duplication and a complex genomic 
organisation of the 5q13 region, long-range restriction mapping using pulsed 
field gel electrophoresis (PFGE) or the YAC contig was performed. 

YACs were ordered by comparing their haplotypes with that of the human 

donor at the polymorphic loci detected markers C212. C272. C171 and C161 

(Fig. 4)^ (^-(%) 
A 
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The restriction enzymes Sacll. BssHII. Sfil. Eafll and Xhol were used to 
digest the YACs containing the telomeric loci detected by markers C212. C272. 
C171 and C161 (YAC clone 595C11). the centromeric loci detected by these 
markers (YAC clones 121B8. 759A3. 278G7) or both (YAC clones 903D1 and 
920C9). Lambda phage libraries of YACs 595C11. 121B8 and 90301 were 
constructed and subclones from phages containing markers C212 {p322), C272 
(132SE11). C161{He3). AFM157xd10(131xb4) and CMS1 (p11M1) were used as 
probes for PFGE analysis. Fig. 5 shows that probes 132SE11. 11P1 and p322 
revealed two loci, and probe He3 revealed 4 loci on the YAC contig. whereas 
probe 131xb4 revealed several loci on 5p and 5q13. The restriction map 
(Figure 6) showed that the 5q13 region contained a large inverted duplication of 
an element (E) of at least 500 Kt-. lermed E^" and E««" for the telomeric and 
centromeric elements, respectively. 

The PFGE analysis of SMA and control individuals revealed a high degree 
of variability of restriction fragments which hampered the distinghishment of E^" 
from the and the recognition of abnormal restriction fragments In SMA 
patients. 

In order to distinguish between the E^" and the E^'". a Southern blot 
analysis was then performed. The Southern blot was performed by the methods 
described in Sambrook et al. SUQUI 

More specifically. DNA from YAC clones, controls and SMA patients was 
digested with restriction enzymes Sacl. Kenl. 1^1. PsiL Pvyll. EfioRI. HlndlH. 
Bal ll and Xba l for Southern blotting and hybridized with clones 132SE11. 11 pi. 
He3, 131xb4 and p322 as probes. None of the probes except one (He3) detected 
a difference between the two duplicated elements. Three idindlil restriction 
fragments of 12. 11 and 3.7 Kb were detected by probe He3. A 12 Kb Hindlll 
restriction fragment was detected in YAC clones 754H5 and 759A3. indicating 
that this fragment corresponded to the most centromeric locus in the E" 



Cen 
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Conversely, a 11 Kb Hindlll fragment was detected in YACs clones 595C11. 
903D1 and 920C9 indicating that this fragment corresponded to a single locus 
on the E^". Finally, a 3.7 Kb Hind lll fragment was noted in non-overlapping YACs 
containing either E^" or E*=°". indicating that this fragment con^esponded to two 
different loci. Similar results were obtained with S^l and KbdI. The three 
restriction fragments delected by He3 were observed on the monochromosomal 
hybrid HHW105 (Oarlock, L.R. el al. Am. J. of Human Genet., 1985. Vol. 37. 
p. 839) and In 30 unrelated, healthy individuals, confirming that these fragments 
were not due to polymorphisms. The Southern analysis results allowed one to 
distinguish e"^" from the E*='" in both controls and SMA patients. 

13 Thus, once the E^*" from the E*«" was distinguished, it was necessary to 

;: determine the differences between the E™ in SMA patients and those of the 

i normal control. This was done by using genetic and physical mapping. This 

3 genetic and physical mapping identified genomic rearrangements in the 

!l telomeric element of E"^"* of SMA patients. 

i It was previously shown that 9 out of 201 (9/201) SMA patients displayed 

:i large-scale deletions encompassing either one or the two loci detected by 

y markers C212 and C272 on one mutant chromosome (J. Melki et al. Scienc e. 

S 264. 1474 (1994)). On the other hand. 22 out of 30 (22/30) patients born to 

consanguineous parents including 13 out of 14 (13/14) type I and 9 out of 10 
(9/10) type III SMA, were homozygous by descent for the most closely flanking 
polymorphic markers. 

The genomic DNA of the J) patients harboring large scale deletions and 
the 22 consanguineous patients displaying homozygosity by descent were 
digested with Hind lll for Southern blotting and hybridized with probe He3. The 
11 Kb fragment revealed by probe He3 was absent In 12 out of 13 (12/13) 
consanguineous type I patients. In 2 out of 12 (2/12). the deletion also Involved 
the 3.7 Kb fragment. By contrast, the 1 1 Kb fragment was absent in 1 out of 8 
(1/8) consanguineous type III patienls only. Consistently, the 11Kb HiodHI 
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fragment was absent in 4 out of 9 (4/9) patients harboring large scale deletions 
on one mutant chromosome. Of particular interest was the absence of the 1 1 Kb 
fragment in the patient harboring a deletion of one of the two loci delected by 
markers C212 and C272. 

When analyzed together, these observations provided evidence for 
genomic rearrangements of E^" in SMA patients and supported the location of 
the SMA gene centromeric to the locus revealed by the 11 Kb [j\nd\\\ fragment, 
since all consanguineous type Ml pttienls but one were not deleted for this locus. 

In order to characterize the centromeric boundary of the genomic 
rearrangement in the disease, the allele segregation at loci detected by marker 
C272 in consanguineous SMA patients was analyzed. All consanguineous SMA 
type I patients had one single PCR amplification product, compared with 0 out of 
60 controls. This marked heterozygosity deficiency was due to deletion of one of 
the two loci detected by C272. as indicated by the marked decrease of gene 
dosage with probe 132SE11. mapping close to this marker. By contrast. 7 out of 
9 (7/9) consanguineous type III SMA patients had two C272 amplification 
products inherited from both partsnts. indicating homozygosity at each locus 
detected by marker C272. Moreovr;^r. no gene dosage effect was observed with 
probe 132SE11 indicating the absence of deletion involving the locus detected 
by C272 in type III consanguineous patients. 

Assuming that the same locus is involved in all three types of SMA. these 
results indicate that the disease causing gene is distal to the telomeric locus 
detected by C272. 

These studies place the SMA gene within the telomeric element E^"', 
between the telomeric loci detected by markers C272 and He3 (11 kb Hindlll 
fragment). Based on long-range restriction mapping using PGFE of the YAC 
contig. this critical region is entirely contained in a 140 Kb Sacll fragment of YAC 
clone 903D1 (or 150 Kb Sac ll fragment of YAC clone 920D9). 
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Atter confirming that the SMN gene was located on a 140 Kb Sacll 
fragment a phage contig spanning the critical region of the telomeric element 
was constructed in order to identify and characterize the SMN gene. 

Phage clones containing markers C212. C272. C171 and C161 were 
isolated from the X phage libraries constructed from YAC clones 595C11 and 
903D1 and used as a starting point for bidirectional walking. A phage contig 
(60 Kb) surrounding markers C212, C272 and C171 was constructed based on 
the restriction map of the phage clones (Fig. 6). 

To identify genes in the contig. the following three stategies were used : 

1 ) a search for interspecies-conscrved sequences was conducted ; 

2) exon trapping method was performed ; and 

3) direct cDNA selection was performed. The genomic probe 
132Sei1. derived from the phage containing the marker C272. gave positive 
hybridization signals with hamster DNA indicating the presence of interspecies- 
conserved sequences. The screening of a XgtIO human fetal brain cDNA library 
with probe 132SE1 1 resulted in the selection of 7 overlapping \ clones spanning 
1.6 kbp. Sequence analysis of the clones revealed a 882 bp open-reading frame 
(ORF) and a 580 bp non-coding region. A 1.5 kbp clone (BCD541) contained the 
entire coding sequence and most of the 3" non-coding region. The 3' end of the 
cDNA along with its poly(A)* tail was obtained by PCR-amplification of a 
lymphoblastoid cell line cDNA library. 

Two cDNA clones lacked nucleotides 661 to 755. suggesting that an 
alternative splicing might have occured. Northern blot analysis of poly(Ar RNA 
from various tissues including heart, brain, liver, muscle, lung, kidney and 
pancreas, revealed the presence of a widely expressed 1.7 kb transcript. The 
ORF encodes a putative protein of 294 amino acids with a predicted molecular 
weight of approximately 32 Kd. 
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A homology search using the FASTA and BLAST networks failed to detect 
any homology at either the nucleotide or the amino acid level. 

To further distinguish whether there was any duplication of the BCD541 
gene in the 5q13 region. BCD541 cDNA was used as a probe for Southern blot 



Specific hybridization with non-overlapping YACs containing either the 
E*^'" only (YAC clones 759A3, 121B8 and 278G7), or containing the E^'* only 
(YAC clone 595C11) provided evidence for duplication of the BCD541 gene. 
Each gene encompassed approximately 20 kb and displayed an identical 
restriction pattern. Evidence for head to head orientation of the two genes was 
derived from the location of the Sacll and EagI restriction sites of the non- 
overlapping YAC clones containing either E^ or E^**. following hybridization 
experiments with probes BCD541 t<nd p322 which flank the Sac ll and EaqI sites 
of each element. 

In order to look for divergences in the two copies of the BCD541 gene, the 
organization of the telomeric gene was characterized and compared to that of the 
centromeric counterpart. Genomic sequence analysis revealed that the telomeric 
BCD541 gene is composed of 8 exons (Fig, 3). However, it is now known that the 
previously known exon 2 is composed of 2 exons separated by an additional 
intron as set forth in Fig. 10, therefore the SMN gene is composed of 9 exons. 

Starting from either the centromeric or telomeric gene loci (in YAC clones 
121B8 and 595C11, respectively), PCR-amplification and sequence of each exon 
and their flanking regions revealed five discrepancies between the centromeric 
and the telomeric BCD541 genes. The first one is a conservative substitution in 
exon 7 (codon 280) specific for the telomeric (TTC) or the centromeric BCD541 
gene (TTT). The second one, located in the 3' non-coding region (exon 8 
nucleotide n^'USS) is specific for the telomeric (TGG) or the centromeric 



and PFGE analysis of YAC clones spanning the disease locus. 
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BCD541 gene (TGA). Three other single base substitutions were observed in the 
sixth and seventh introns. 

The observation of both versions of each'^exon (exon 7 and 8) on either 
YAC clones containing both gene loci (YAC clone 920C9) or the 
monochromosomal hybrid HMW105 demonstrated that these substitutions are 
neither allelic nor due to polymorphisrr^s. Band shifts on SSCP analysis of 
amplified exons 7 and 8 allowed an easy distinction of the telomeric (T-BCD541) 
and centromeric genes (C-BCD54 }) in both controls and SMA patients. All the 
unrelated healthy controls tesleo (n=76) harbored the T-BCD541 gene as 
determined by SSCP analysis of exons 7 and 8 (100%). Most of them (89.3%) 
also harbored the C-BCD541 gene but 8 out of 75 (8/75) (10.7%) lacked the 
C-BCD541. 

A total of 230 SMA patients were tested for single base substitutions 
detected in exons 7 and 8 by SSCP method after PCR-amplification of genomic 
DNA. Among them. 103 belonged to type I. 91 to type II. and 36 to type III. 
Interestingly. 213 out of 230 SMA patients (92.6%) lacked the T-BCD541 gene 
on both mutant chromosomes <:o/npared with 0 out of 75 controls (0%). 
Moreover. 13 out of 230 SMA paiients (5.6%) lacked the T-BCD641 gene for 
exon 7 on both mutant chromosomes but retained the T-BCD541 gene for exon 8 
compared with 0 out of 75 controls (0%). Finally, only 4 out of 230 SMA patients 
(1.7%) harbored the T-BCD541 gene as determined by SSCP analysis of exons 
7 and 8. 

These results show that the T-BCD541 gene is either lacking or truncated 
in 98% of SMA patients. In addition, these data support the view that the disease 
gene is located between the telomeric locus detected by C272 and exon 8 of the 
T-BCD541 gene. Therefore, according to the overlapping restriction map of the 
phage contig. the critical region is entirely contained in 20 kb. suggesting that 
the telomeric BCD541 gene is the chromosome 5 SMA-determining gene. 
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In order to demonstrate that the T-BCD541 gene is responsible for SMA, 
point mutations in the 4 SMA patients in whom no rearrangement of the T- 
BCD541 gene had been observed were searched. Direct sequencing of PGR 
amplification products of each exon with their flanking regions was performed in 
the four patients. 

A 7 bp deletion in the 3' splice acceptor site of intron 6 (polypyrimidine 
tract) was found in patient SA. Sequence analysis of exon 7 flanking the deleted 
intron. recognized the sequence specific for the T-BCD541 gene. Moreover, the 
non-deleted PCR-product corresponding to the same region, harbored the 
sequence specific for the C-BCD541 suggesting that the other mutant allele 
{ lacked the T-BCD541 gene. 

j In patient Bl. a 4 bp deletion in the 5' consensus splice donor site of 

I intron 7 was found. This deletion occured on the T-BCD541 gene as determined 

; by sequence analysis of the flanking exon 7. 

\ In patient HU. a point mutation in codon 272 (TAT-»TGT) was found. This 

i mutation changed a Tyrosine to Cysteine. The patient was hetenazygous for the 

mutation, presumably carrying a different SMA mutation on the other allele. All 
three mutations observed in patients SA, HU and Bl were not detected in 100 
normal chromosomes ruling out rare polymorphisms. 

A different splicing of exon 7 distinguished the C-BCD541 from the T- 
BCD541 gene using reverse transcription-based PGR. Eleven SMA patients 
were selected for the analysis of their transcripts by Northern blot or reverse 
transcription-based PGR amplification. Eight of them belonged to type I. 1 to 
type li and 2 to type III. SSCP analysts of genomic DNA showed an absence of 
T-BCD541 gene in 10 patients and one patient (SA) had G-BGD541 and T- 
BGD541 genes for both exons 7 and 8. Six unrelated controls who harbored both 
C-BCD541 and T-BCD541 genes ar\d 2 controls with only T-BCD541 gene were 
included in the present study. 
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The expression of this gene in lymphoblasls made it possible to analyze 
the BCD541 transcripts in cell lines derived from controls and SMA patients. 
Northern blot analysis of RNA from lymphoblastoid cell lines showed the 
presence of a 1.7 kb mRNA in all samples. None of the SMA patients showed a 
transcript of altered size. It was observed that a reduced level of transcripts was 
obtained when compared to the expression of the p-actine gene in 3 out of 4 
type I SMA patients. Normal mRNA level were found for the other SMA 
probands. 

Since the Northern blot analysis revealed the presence of a transcript in 
SMA patients who had the C-BCD541 gene only for both exons 7 and 8 as 
determined by SSCP analysis, these results indicated that both C-BCD541 and 
T-BCD541 genes were expressed. To prove whether both BCD541 genes were 
expressed. RT-based PCR amplinc^tion of RNA isolated from the lymphoblastoid 
cell lines from controls and SMA patients was used. Direct sequencing of PCR 
products flanking exons 7 and 8 revealed that patients who had C-BCD541 only 
displayed the sequence specific for the C-BC0541 gene. Controls who had both 
T-BCD541 and C-BCD541 genes, had two types of transcripts corresponding to 
both BCD541 genes. These results confirmed that both genes were expressed. 
In addition, 2 alternative splicings involving exon 5 or exon 7 that resulted in 
different transcripts were observed. The alternative splicing of exon 5 confirmed 
previous sequence data on the cDNA clones. 

The analysis of the RT-PCR amplification products encompassing exons 6 
to 8 showed that the spliced transdript keeping exon 7, was present in controls 
who had both C-BCD541 and T-BCD541 genes or controls who had the T- 
BCD541 gene only. Conversely, the allernative spliced transcript lacking exon 7 
was observed in controls who had both genes, but not in controls who had the T- 
BCD541 gene only. These results indicated that the allernative spliced transcript 
lacking exon 7 was derived from the C-BCD541 gene only. 
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The transcript analysis of patient SA harboring a 7 bp deletion of the 3" 
splice acceptor site of Intron 6 of the T.BCD541 gene revealed the presence of 
both spliced transcript keeping exon 7 and alternate spliced transcript lacking 
exon 7. Moreover, the sequence analysis of amplification products from the 
spliced transcript keeping exon 7. showed a sequence specific for the C-BCD541 
gene (Fig. 2). These results demonstrated that the 7 bp deletion of intron 6 
observed in patient SA was deleterious for the correct splicing of exon 7 of T- 
BCD541 gene only. In addition, because a differential splicing of exon 7 allowed 
one to distinguish the 2 BCD541 genes, this difference was analyzed among 
controls and SMA patients including patient SA. In controls, the amount of 
alternated spliced transcript lacking exon 7 was less abundant than that of 
i spliced product keeping exon 7. Conversely, in SMA patients, the amount of 

!l alternated spliced transcript lacking exon 7 was equal or more abundant than 

i that of spliced product keeping exon 7. 

3 These results provide evidence for a difference between controls and 

SMA patients at the transcription ievel of these genes. The alternative spliced 
J transcript lacking exon 7 resulted in a shorter ORF with a different C-terminus 

; protein that might have effects on the protein function. 

5 To further characterize the entire structure and organization of the human 

^ SMN gene, three genomic clones were isolated from a FIX li phage library 

derived from YAC clone 595C11 and screened with the full-length BCD541 
cDNA (Fig. 2A) as a probe. After selecting several clones that hybridized to the 
probe, restriction mapping and Southern blot analysis indicated that phages L- 
132, L-5 and L-13 spanned the entire SMN gene. 



These three phage clones v.sre further subjected to sequencing using the 
Maxam-Gilbert or Sanger et al methods of sequencing disclosed in Sambrook et 
al supra. 
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The nucleotide and amino acid sequence of the entire SMN gene 
including exons and introns is set forth in Figure 10. The human gene is 
approximately 20 kb in length and consists of nine (9) exons interrupted by 
8 introns as shown in Figure 10. The human SMN gene has a molecular weight 
of approximately 32 kDA. 

Although it was thought that only one exon 2 was present in the SMN 
gene (see. Lefebvre et al. Cell. 80:155-165 (1995)), the sequencing data proved 
othenA/ise and the previously mentioned exon 2 in Lefebvre et al supra is in fact 
composed of 2 exons separated by an additional intron, as illustrated in 
Figures 9 and 10. To avoid confusion in the renumbering of exons, the 2 exons 
in exon 2 are now referred to as exon 2a and exon 2b. 



All exon-intron bounderies displayed the consensus sequence found in 
other human genes and a polyadenylation consensus site is localized 550 bp 
downstream from the slop codon (Fig. 10). 



Starting from either YAC clones 121B8 or 595C11 (which contain the C- 
BCD541 and SMN genes respectively, (see, Lefebvre et al, s upra ) PCR 
amplification and sequence analysis of the introns showed three differences 
between SMN and C-BCD541 ir\ addition to those previously described (by 
Lefebvre et al, supra ). These included a base change in intron 6 (-45bp/exon 7. 
atgt, telomeric; atat, centromeric) and two changes in intron 7 (+100bp/exon 7. 
ttaa, telomeric; ttag, centromeric and at position +214bp/exon 7. Uat, telomeric; 
ttgt, centromeric. Figure 10). The presence of both versions in a YAC clone 
containing both genes (YAC 920C9). and in the control population demonstrated 
that these substitutions are locus- specific rather than due to polymorphism. 
Band shifts on single strand conformation polymorphism (SSCP) analysis of the 
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PGR amplified inlron 7 allowed SMN and its centromeric counterpart (C- 
BCD541 ) to be readily distinguished (see. Figure 1 5). 

In order to identify sequences potentially important for promoter function, 
the organization of the region surrounding exon 1 of the SMN and C-BCD541 
genes was characterized. Based on restriction mapping. Southern blot 
hybridization and PGR amplification, exon 1 and the G272 marker (D5F150S1. 
D5F150S2) were located in the same BfllM-gcoRI restriction fragment of L-132 
3 phage (Figure 9). PGR amplification using the C272f primer and a reverse primer 

I chosen in exon 1 was performed and the amplified product was directly 

sequenced. Sequence analysis showed that the (GA) repeat corresponding to 
the G272 marker are located 463bp upstream from the putative ATG translation 
start site (Figure 1 1 ). Gomparative sequence analyses showed no discrepancy 
between the 5' ends of the SMN gene and its centromeric counterpart (C- 
BGD541). In addition, sequence analysis showed the presence of putative 
binding sites for the following transcription factors: AP-2, GH-CSE2. DTF-1. 
E4F1. HiNF-A. H4TF-1. p-IFN. Sp1 (Figure 11 ; Faisst et al. Nucleic Acids Res.. 
20.3-26(1992)). 

Besides isolating and characterizing the human SMN gene, the mouse 
homologue of the SMN gene was also cloned. Cross-species consen/ation of 
human SMN gene with rodents has been shown in Lefebvre et al. s upra and 
served to isolate the mouse SMN gene. Screening of a mouse fetal cDNA library 
using human SMN cDNA as a probe allowed the isolation of 2 overlapping 
mouse cDNA clones. Sequence enalysis of the clones revealed an 864 bp open- 
reading frame (ORF) (Fig. 12). The ORF encodes a putative protein of 288 
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amino acids (Fig. 12) with an homology of 83% with human SMN amino acid 
sequence (Fig 13). 

Either the isolated human or the mouse SMN. the gene can be inserted 
into various plasmids such as pUCl8. pBr322. pUCIOO. XgHI. X18-23. XZAP. 
XORF8. and the like. The methods for inserting genes into different plasmid 
vectors are described by Sambrook et at Si^ra. Various microorganisms can be 
used to transform the vector to produce the SMN gene. For example, host 
microorganisms include, but are not limited to. yeast. CHO cells. E^coH. Bacillus 
subtili s and the like. 

;i Once recomblnantly producea. the human SMN protein or the mouse SMN 

K protein can be further purified from the host culture by methods known in the art. 

11 Besides recombinantly producing the SMN protein, the present invention 

ilj also relates to the production of polyclonal and monoclonal antibodies. These 

methods are known in -the art as evidenced by Sambrook et al sym^- The 
monoclonal antibody can be obtained by the procedure of Kohler and Milstein. 
a Nature. 256:495 (1976) ; Eur. J. immunoL. 6:511 (1976) or Harlow and Lane 

Jji ^tibodies. a Laboratory Manual. Cold Spring Harbor Laboratory. Cold Spring 

m Harbor. New York (1988). and can be used, for example, in diagnosing SMA, as 

well as other motor neuron disorders. 

Polyclonal rabbit antisera can also be generated against synthetic 
peptides corresponding to any part of the SMN aminp acid sequence including 
the amino terminus and carboxy terminus. More specifically, the following 
peptides were synthesized based on the amino acid sequence set forth in 

Figure 1 : Cl^'^ "^-^^ ""^"^ 



AO!?) 



N-terminal G G V f M - Q E D S V L F R R G T^ C-terminal -pp , 

SRSPGNKSDNIKP K^«^9«' A*^« [T^ -^"^^ 
FRQNQKEGRCSHSLNrv?-?; ^ 2<lro-z9 P 4 ^^^"^ 
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The synthetic peptide may be coupled to a carrier protein such as Keyhole 
limpet hemocyanin (KLH) through an amino- or carboxy-artificial cysteine residue 
that may be synthetically added to the desired sequence. The cysteine residue is 
used as a linker to couple the synthetic peptide to the carrier protein. The 
procedure utilized to couple synthetic peptides to KLH is described by Green et 
al. Celi. 28:477 (1982). 

Approximately, 60-100 tjg. preferably 100 pg of synthetic antigen is 
dissolved in buffer and emulsified with an equal volume of Freund's complete 
adjuvant. About .025 ml to 0.5 ml of emulsified antigen-adjuvant can be injected 
intramuscularly or intradermaly into a rabbit. Four to six weeks later, the rabbit is 
boosted and 20-40 ml of blood is drawn 7-10 days after each booster injection. 
The serum is then tested for the presence of antigen using RIA, ELISA or 
immunoprecipitalion. The positive antibody fractions may then be purified, for 
example by absorption to protein A following the method of Goudswaald et al. 
Scand. J. Immunol.. 8:21 (1978). 

More specifically, about 20 to 50 pg of antigen, prepared either by the 
recombinant techniques set forth above or synthetically made antigen is diluted 
in about 100 pi of buffer and emulsified with an equal amount of Freund's 
complete adjuvant. About 30-60, preferably 50 pi of the emulsified antigen- 
adjuvant is injected subcutaneously at four sites into mice. Four to six weeks 
later, the mice are boosted with an intraperitoneal injection of about 100 pi 
containing 5-10 pg of antigen solubilized in buffer. The mice are bled from the 
mediam tail vein 7-10 days atter the boaster injection and the serum is tested for 
antibody using standard methods. Blood is then drawn every 3-4 days until the 
antibody titer drops. 

Tissue, plasma, serum, cerebral spinal fluid and the like can be used to 
delect SMA disease using the above-described monoclonal or polyclonal 
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antibodies via Western bJot (1 or 2 dimensional) or ELISA. These methods are 
known in the art as described by Sambrook et al. sunra. 

A method for detecting SMA as well as in ALS. ACM. and PLS patients 
who possibly have these motor neuron disorders, is also encompassed by the 
present invention. This method involves extracting from a patient suspected of 
having SMA. DNA from a sample. This sample may include sera, plasma, 
cerebral spinal fluid and the like. Aftfjr extracting the DMA by known methods in 
the art. primers that are derived from exons 7 and 8 of the SMN gene are used to 
amplify the DNA. 

After amplification with the primer, the amplified product is subjected to 
SSCP (Single Strand Conformation Polymorphism). 

The gels are then subjected to autoradiography to determine if SMA is 
present in the sample. 

More specifically, it has recently been discovered that in twelve cases of 
arthrogryposis multiplex congenita (AMC) associated with SMA. 6 out of 12 
patients lacked the SMN gene. 

A total of twelve unrelated patients including eight males and four females 
of various geographic origins was selected for the study. The patients were 
chosen based on the criteria that these patients had : 

(1) congenital joint contractures of at least two regions of the body 
(see. Stern . JAMA . 81:1507-1510 <1923)) ; 

(2) generalized muscle wtiakness with muscular atrophy and areflcxia 

without extraocular involvement ; 

(3) electromyographic studies showed denervation and diminished 

motor action potential amplitude ; and 
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(4) muscle biopsies consistent with denervation with no evidence of 
storage material or other Mructurol obnormQiitica (ace. Munoot. Ncuromucculor 
Disorders. 1:81 (1991)). 

The study corisibted uf DiiiucleuUde Repeal PwlyniwiK''"*'!^ Analysis and 
SMN gene analysis (see. Examples) based on DNA extracted from peripheral 
blood leukocytes, lymlphoblasloid cell lines or muscle tissue in all twelve 
patients. 

The data from this study is suriiriiaii^ed in Table 1 below. 

The diagnosis was maae at binn witn an uniform phenotype uiaiacieiUeU 
by a severe hypotonia, absence of movements except extraocular mobility and 
contractures of at least two joints. The number of affected joints and the severity 
of the postural defects varied from infant to Infant, as set forth in Table 1 . 
Decreased fetal movements were noted in 7 out of 12 (7/12) patients. Neonatal 
respiratory distress was observed in 9 out of 12 (9/12) patients and facial 
involvement associated with microjinathia was noted in 4 out of 12 (4/12) 
patients. Most of the cases. 8 out of 12 (8/12). died within the first month of life. 
Four infants are still alive. No family history was noted except in family 12 in 
which both the child and her father were affected suggesting an autosomal 
dominant form of AMC 



Table 1 shows that the SMN gene was lacking on both mutant 
chromosomes in 6 out of 12 (6/12) patients (cases 1-6). Among them. 3 out of 6 
(3/6) patients had a large inherited deletion involving both loci detected by 
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markers C212 and C272 on one parental allele, the other parental carrying only 
one locus instead of the expected two. as shown in Figure 14. 

Analysis of SMN exons did not reveal intragenic mutations in the patients 
whose SMN gene showed no deletions (cases 7-12). Genetic analysis showed 
that the disease gene in a family (case 9) was not linked to chromosome 5q13 as 
both the affected and healthy siblings carried the same 5q13 haplolype. These 
data strongly suggest that the patients whose SMN gene showed no deletions 
were not linked to the 5q13 SMA locus (cases 7-12). 

Hitherto, arthrogryposis was regarded as an exclusion criterion in SMA 
(sec, Munsat. supra). But the observation of SMN gene deletion in 6 out of 12 
(6/12) patients (50%) strongly indicates that arthrogryposis of neurogenic origin 
is related to SMA and that this subgroup and SMA are allelic disorders. Yet. 
AMC of neurogenic origin is a genetically heterogeneous condition since the 
disease gene was not linked to SMN locus in 6 out of 12 (6/12) patients. 
Exclusion of chromosome 5q has also been shown in one family with two AMC- 
SMA patients, as described by Lunt et al. J, Med. Genet.. 29:273 (Abstract) 
(1992). 

Thus, by dinucleotide Repeat Polymorphism Analysis and SMN gene 
analysis, clinical diagnosis of AMC can be confirmed by the absence or 
interruption of the SMN gene. The present invention now provides methods to 
detect AMC either in live patients or in utero. 

Yet another embodiment of the present invention is the detection of SMA 
using specific oligonucleotide probes based on the nucleotide sequence set forth 
in Figures 3, 10. or for the mouse SMA Figure 12. If a patient totally is lacking in 
the SMN gene, no hybridization tc the specific probe will occur. The hybridization 
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conditions may vary depending upon the type of sample utilized. It is preferable 
to conduct such hybridization analysis under stringent conditions which are 
known in the art and defined in Sambrook et al syera. The oligonucleotide 
probes may be labeled in any manner such as with enzymes, radioactivity and 
the like. It is preferable to use radiolabeled probes. 

In another embodiment of the present invention, the human SMN gene 
can be utilized in conjunction with a viral or non-viral vector for administration jn 
vivo directly to the patients suffering from SMA or related motor neuron diseases 
or by administration in_yitro in bone marrow cells, epithelial cells f.broplasts, 
followed by administration to the patient. See. for example Resenfeld et al. 
Science (1 991 ) 252 . pp. 431 to 434. 

The present invention provides a method of detecting SMN gene defects 
or the total lack of the SMN gene in a fetus. Amniotic fluid taken from the 
pregnant woman is subjected to SSCP analysis according to the methods of the 
present invention. 

In order to further illustrate the present invention and advantages thereof, 
the following specific examples are given, it being understood that the same are 
intended only as illustration and in nowise limitative. 
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EXAMPLE S 



EXAMPLE 1 

Construction of phage libraries from the 121B8. 595C11, and 903D1 
YAC clone. 

Total yeast DNA from YAC clone 595C1 1 contaiinng the telomeric loci 
delected by C212. C272 and C161. or YAC clone 121 B8 containing the 
cenlromeric loci delected by the same markers or 903D1 YAC clone containing 
both loci was purified and partially digested with Sau3A. DNA in the size range 
of 12 to 23 kb was excised after 0.5% Seaplaque GTG agarose get 
electrophoresis and precipitated with etnanoi aner p-agarase uigcsiion. Ari«t 
partial flll-in of the Sau3A site. DNA was subcloned at the partially filled 2<hol site 
of bacteriophage FIXIII (Stratagene). Clones of X containing the microsatellite 
DNA markers C212 (L-51). C272 (L-51. L.132). C171 (L-5. L-13). C161 (595B1). 
11M1 (L-11). AFM157xd10 (L-131) were digested either with EcoRI or Hmdlll or 
both and subcloned into pUC16 plasmid vectors. Subclones from phages 
containing markers C212(p322). C272(132SE11). C161(He3). 
AFM157xd10{131xb4) and CMS1(p11M1) were used as probes. 



EXAMPLE 2 

Pulsed field gel electrophoresis analysis 

High molecular weight DNA was isolated in agarose plugs from Epstein- 
Barr virus transformed lymphoblastold cell lines established from controls and 
patients or from YAC clone as described. Plugs were rinsed twice for 30 min. 
each in 10-20 min vol. TE. The plugs were equilibrated for 30" at 4-C with 0.3 ml 
of the appropriate restriction enzyme buffer containing 0.1 mg/ml BSA 
(Pharmacia). Excess buffer was \.hen removed and the plugs wore incubated at 
the appropriate temperature for 15 h with 40 U restriction enzyme per reaction. 
DNA was digested with the restriction enzymes BssHII, Eagl. §fil, Sacl. Kpnl. 



13 
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Sacll. Spe l. Separation of DNA fragments was performed using a CHEF-III-DR 
PFGE apparatus (Biorad). Fragments from 50 to 1200 kb were separated by 
electrophoresis through 1% agarose Seakem. at 200 V for 24 h at ^A''C in 
0.5 XTBE running buffer using a 30' to 7a ramping pulse time. The separation of 
fragments from 5 to 100 kb was performed by electrophoresis at 200 V for19 h 
at 14-0 in 0.5 x TBE buffer using a 5* to 20* ramping pulse time. After treatment 
with 0.25N HCI for 20 min. pulsed field gels were blotted onto Hybond N+ Nylon 
membrane (Amersham) in 0.4N NaOH. 0.4M NaCI for 20 h. Probes were 
successively hybridized to the same filters to ensure accurate data. 
Hybridizations were performed as described. 



EXAMPLE 3 

YAC library screening 



YAC libraries from CEPH were screened by PGR with microsatellites 
C212. C272. C171. CMS1. and C161. YAC genotypes were established by 
electrophoresis of PGR products on denaturing polyacryiarnide gels. YAG size 
was estimated by pulsed field gel electrophoresis. 



EXAMPLE 4 
Southern blot analysis 



DNA samples were extracted from either peripheral blood leukocytes or 
lymphoblastoid cell lines. DNA were digested with restriction enzymes EcoRI. 
Hind lll. Bglll. Xbal. Pyull. Xmnl. Rsa'. Estl. fiamHI. separated by electrophoresis 
on an 0.8% agarose gel for Southern blotting and hybridized with radioactively 
labeled probes. 
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EXAMPLE 5 

Dinuclootide repeat polymorphisms 

Genotyplc data were obtained for the C212(D5F149S1, -S2). 
C272(D5F150S1, -S2) and C161(D5F153S1. -S2) dinucleotide repeat 
Amplification conditions were as follows : denaluration at 94*C, annealing at 
55°C, and extension at 72*'C, 1 min each for 30 cycles. The procedure used for 
detection of dinucleotide repeat polymorphisms has been described elsewhere. 



EXAMPLE 6 

^3 cDNA clone and DNA sequencing 

Is? 

□ Two million recombinants of a XgtIO human fetal brain library were plated 

according to the manufacturer (Cfontech). Prehybridization and hybridization was 
W carried out in 10% Dextran Sulphate Sodium, 1 M NaCI, 0.05 M Tris-HCI pH 7.5. 

0.005 M EDTA and 1% SDS with 200 mg/ml sheared human placental DNA 
=3 (Sigma) for 16 hours at 65X. The filters were washed in 0.1 X SSEP-0,1% SDS 

=2 at 65**C and autoradiographs were performed for 24 hours. The DNA of positive 

IJ cDNA clones were purified, digested with E_co Rl and subcloned in Ml 3 

J; bacteriophage. Single strand DNAs were sequenced using the DyeDeoxy^"^ 

Terminator Cycle Sequencing Kit protocol supplied by Applied Biosystems. Inc. 
and analyzed on a ABI model 373A DNA automated sequencer. To obtain the 3' 
end of the cDNA along with its poly(A)* tail. PCR-amplification of a 
lymphoblastoid cell line cDNA library was performed using specific primer 
complementary to the 3' end of the clones and primer specific to the vectors 
arms of the cDNA library as previously described (Fournier 8,. Saudubray J.M., 
Benichou B. et al. 1994, J. Clin. U wesX, 94:526-531). Specific PCR-products 
were directly sequenced with both primers using the DyeDeoxy^*^ Terminator 
Cycle Sequencing Kit protocol supplied by Applied Biosystems, Inc. and 
analyzed on a ABI model 373A DNA automated sequencer. 
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EXAMPLE 7 

Isolation of RNA and Northern blot analysis 

mRNA from lymphoblast cell lines of controls and SMA patients were 
isolated with the QuickPrep mRNA purification kit (Pharmacia) according to the 
supplier's procedure. Total RNA was prepared following the single-step RNA 
isolation method described by ChomczynskI and Sacchi (Analytic Bi ochemistry. 
162 :156*159 (1987)). The total RNA preparation was treated with RQI-DNAse 
(Promega) to remove any contaminating genomic DNA. Northern blots were 
made from mRNA and total RNA by electrophoresis through 1.5% seakem 
agarose gel containing methyl mercuric hydroxide and transferred to positively 
charged membrane in 20 X SSC and heated for 2 hours at SC'C. ^^P- 
radiolabeled DNA probes were synthesized by a random priming method 
according to the manufacturer (Boehringer). and hybridized in a solution 
containing 6X SSEP, 1% SDS. 5X Oenhardt's for 16 hours at 65X- The 
membranes were washed to a final stringency of 0.1 X SSEP. 0.1% SDS at 65*C 
for lOmin. Autoradiography was at -80**C with intensifying screens and Kodak 
XAR films for 2 to 10 days. The amount of mRNA was normalized with a b-actine 
cDNA probe. The autoradiographs were scanned at 600 nm in computerized 
densitometer (Hoeffer Scientific Instruments. San Francisco). A Northern blot 
with polyA+ RNA from several huma tissues was purchased from Clontech. 

EXAMPLE 8 

Reverse transcriptase-based PGR amplification and sequencing 

Each PGR amplification was carried out in a final volume of 20 ml on 
single-strand cDNAs synthesized from the random hexamers-primed reverse 
transcription (Promega). The PGR reactions included 2 picomoles of forward and 
reverse primers and 1 unit Tag polymerase in the reaction buffer recommended 
by Perkin Elmer/Getus. Parameters for PGR amplification consisted in 1 min at 
94"G, 1 min at 55**G and 1 min at 72*G for 30 cycles followed by a final extension 
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period of 10 min at 72**C. Parametors for PGR amplification consisted in 1 min at 
94*C. 1 min at 55**C and 1 min at 72X for 30 cycles followed by a final extension 
period of 10 min at 72*C. The PGR products NA^re cut from acrylamide gel and 
eluted in 100 ml of TE buffer. The diluted fragments were reamplified with the 
same primers prior direct sequencing. The PGR amplification products were cut 
from acrylamide gel and eluted in 100 ml of TE buffer. The diluted fragments 
were reamplified prior to direct sequencing with both primers using the 
DyeDeoxy^*^ Terminator Gycle Sequencing Kit protocol supplied by Applied 
Biosystems. Inc. and analyzed on a A8I model 373A DNA automated sequencer 
Six sets of primers along the cDNA sequence were used to amplify DNA 
products for sequence analysis. 

EXAMPLE 9 

Computer-assisted Analysis 

Sequence homology analysis with both nucleotide and protein sequences 
from 541 C were performed using FASTA and BLAST through the GJTI2 French 
network (Dessen P.. Fondrat G.. Velcncien C., Mugnoer G.. 1990. CABIOS; 
6:355-356). 

EXAMPLE 10 
SSCP Analysis 

For single strand conformation polymorphism (SSGP) analysis, DNA from 
peripheral leukocytes (200 ng) was submitted to PGR amplification using 
unlabelled primers (20 pM) in 25 pi amplification mixture containing 200 pM 
dNTPs. 1 unit of Jag polymerase (Gibco-BRL) and 0,1 pi of a ^P dCTP 
(10mGi/ml, NEN). Amplified DNA was mixed with an equal volume of formamide 
loaded dye (95% formamide, 20 mM EDTA. 0.05% bromophenol blue. 0.05% 
xylene cyanol). The samples (5pl) v.'cre denatured for 10 mn at 95*'G and loaded 
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onto a polyacrylamido gel (Hydroling MED. Bioprobe) and electrophoresed at 
4-C for 18 to 24 hours at 4W. Gels were transferred onto 3 MM Whatman paper, 
dried and autoradiographed with Kodak X-OMAT films for 24 hours. To amplify 
^ the DNA sequence containing the divergence of 

^ (5- AGACTATCAACTTAATTTCTGATCA and >^541C770 

(S-TAAGGAATGTGAGCACCTTCCTTC yj^rV Kxs^^i. To amplify the ONA 

_ sequence containing the divergence ^.«l^?r^^^ 

^ (5> GTAATAACCAAATGCA^T^GTGAA^^-);^^ and 541C1120 

(5- CTACAACACCCTTCTCACAG 3| were used. 



gXAMPLE 11 

Cloning of the human SMN gene 



Total yeast DNA from YAC clone 595C11 was purified via the method of 
Sambrook et al supra and partially digested with restriction enzyme Sau3A. DNA 
in the 12-23 kD size range was excised after 0.5% sea plague GTG agarose gel 
electrophoresis and precipitated with ethanol after p-agarase digestion. After 
partial fiU-ln of the Sau3A site. DNA was subcloned at the partially filled Xhol site 
of bacteriophage FIXII (Stratagene). 

The full-length BCD541 cDNA was used as a probe to screen the FIXII 
phage library under conditions set forth in Sambrook et al. sum^- 

These phages, named M-1 32. L-5 and L-1 3 spanned the entire SMN gene 
as confirmed by restriction mapping using Hindm. EcoRI and Bgill (see. Fig. 9) 
and Southern blot analysis. 

The phages were then sequenced as described in Example 8. Once the 
gene was sequenced, it was then cloned into a pUC18 vector and recombinantly 
reproduced in large quantities that were purified for further use. 
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EXAMPL E 

Cloning of th« mouse SWN gene 



A mouse fetal cDNA library was screened using the coding sequence of 
the human SMN cDNA as a probe according to Sambrook et al. sum^. 

Two overlapping mouse cDNA clones were found that had the entire 
sequence of mouse SMN, as revealed by sequencing methods described in 
Example 8 after being cloned into a pUC18 vector and M13 vectors. 



EXAMPLE 1 3 
Transgenic mouse 

Transgenic mice containing multiple normal SMN genes OR SMN genes 
lacking exon 7 are produced by the methods according to Lee et al. Neuron. 13; 
978-988 (1994). The transgenic animals are then tested and selected for the 
overexpression of the SMN gene or SMN gene lacking exon 7 via Southern, 
and/or Northern blots using the probes described in the present invention or by 
screening with antibodies described in the present invention in a Western blot . 

Transgenic mice containing abnormal SMN genes are obtained by 
homologous recombination methods using mutated SMN genes as described by 
Kuhn et al. Science . 269; 1427-1429 (1995) and Bradley. Current Opinion in 
Biotech nology. 2; 823-829 (1991). The transgenic animals are then tested and 
selected for the overexpression of the SMN gene via Southern, and/or Northern 
blots using the probes described i.-. the present invention or by screening with 
antibodies described in the present invention in a Western blot selected for the 
abnormal SMN gene. 
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EXAMPLE 14 
Polyclonal antibodies 

100 pg of a synthetic antigen having sequence : , ^ ^? C<5<B 

N-terminal GGVPEQEDSVLFRRGT^ C-terminal 

was dissolved in buffer and emulsified with an equal volume of Freund's 
complete adjuvant. 0.5 ml of the emulsified synthetic antigen-adjuvant vi^as 
injected intramuscularly into a rabbit. Five weeks later, the rabbit was boosted 
and 20-40 ml of blood was drav^ 8 days after each booster injection. The serum 
was then tested for the presence of'antigen using RIA. 

Polyclonal antibodies were also prepared by the same methods using the 
following sunthetic antigens : /.V n r za, -r>A >t/5 

N-terminal SRSPGNKSDNIKPKa C-terminal ^ 
PRnMOKEGRCSHSLN C AM^^ 1,^0'^^^ (r 



EXAM PLE 1§ 
Gene Therapy 



Using the adenovirus construct described by Ragot et al. Nature. Vol. 361 
(1993). the normal SMN gene was inserted therein and injected intramuscularly 
into a patient lacking this gene. The patient is monitored using SSCP analysis as 
described in Example 10 above. 



While the invention has been described in terms of various preferred 
embodiments, the skilled artisan will appreciate that various modifications. 
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substitutions, omissions and changes may be made without departing from the 
spine thereof. Accordingly, it is intended that the scope of the present invention 
be limited solely by the scope of the following claims, including equivalents 
thereof. 
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